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ABSTRACT 

A study examined the feasibility of developing a cost 
effective sampling technique which would estimate the mean number of 
patrons using the Purdue University General Library during one 
semester. The technique employed , called random sampling without 
replacement, meant that, from the total population of days in the 
Fall Semester, particular days for counting would be chosen at random 
and used only once. Confidence bounds and cost figures were estimated 
for each sample size, based on the 1972 statistics of the library, 
Time-and-^motion study techniques were employed in the re-examination 
of functions currently performed by the exit-checkers at the library, 
Check-»out ,and esit procedures were modified to increase efficiency, 
and rough estimates computed of the daily cost of counting exiting 
patrons under the old and new procedures. In a university setting, 
library patronage is subject to such variables as vacations and exam 
tiJies; therefore a stratified random sampling technique, where each 
day of the week was selected a given number of times, might yield 
lower error estimates than those computed by the ••pure" random 
sampling method used in this study, (SL) 
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Introduction 

Without* question, current information and data on the dynamics of a 
large university libra.ry system is esjsential for its proper management and 
administration. However, increasingly, library administrators are faced 
with the need for more data to corapl-ote internal comparisons, to compare 
one library system with another, and/or to satisfy e:rternal requests for 
varied and more detailed data. These small but steady demands for additional 
data may cause once effectively collected data passed through efficient 
communication channels within the library system to degenerate. 

In many instances, staff continue to spend sigtiificant amo\mts of time 
in data collection routines for which they may show little concern or have 
little knowledge of why such a task is \mdertaken. Complex and unwieldy 
data collection policies and procedures often become ingrained and accepted 
in library systems. Thus, those sensitive to the high costs incurred 
in such data collection routines are staggered by the efforts required to 
bring efficiency and logic to these data collection activities. ! 

Predictably, library administrators reflect growing concern about the 
efficiency of such activities and the Impact these traditional policies 
and procedures place on other pressing task priorities. Thus, they search 
for more efficient data collection techniques. Can a less expensive, yet 
reliable, procedure be established which will provide an adequate estimate 
of the n\mber of patrons utilizing any given library for any given time 
period? 

Cfbjectives 

She objective of this study is to develop, a cost effective sampling 
tedinique which will provide estimates of the number of patrons utilizing 



the Purdue General Library during the Fall Semester of 1975. Confidence 
bounds will be established for various sample sizes, and cost figures vdll 
be estimated based on fimctions performed by exit checkers. 

Methodo2.ogy 

Bie current problem concerns whether or not a sampling technique can 
be developed that will yield an adequate estimate of the mean nvanber of 
patrons who will utilize the Purdue General Library on a daily basis 
during the Fall Semester of 1975. In this case, a sanqple can be defined 
as that portion of the population which vrill be used to obtain an estimate 
of the mean number of patrons who daily utilize the General Library during 
the 1975 Fall Semestor time period. For purposes of this study, the popu- 
lation frcdP. which the sample will be drawn will consist of the total days 
that the Geaeral Library is open during the Fall Semester of 1973- A 
sampling technique will be employed which Dixon and Massey (I969) refer to 
as random sampling without replacement. As applied to this problem, this 
technique is one in which the particular calendar days selected (from the 
total populfition of days available during the Fall Semester) for counting 
the nmber of patrons utilizing the library would be chosen at random and 
without replacement. Briefly, sainpling without replacement means that an 
item (in this case, a particular day) could be chosen only once, e.g., if 
a particular calendar day has been selected and used by a random process, 
it cannot be selected and used again for it has not been placed back in the 
original population. 

Let us assoime that a large number of equal sized samples of Fall Semester 
days were drawn without replacement from the population of calendar days. 



The niuriber of calendar days in each of these samples is denoted by N» The 
distribution of the mean number of patrons utilizing the Gr^neral Library 
in each of these samples has a standard deviation. This standard deviation 
is known as the standard error" of the mean. The standard error of the 
mean is gi^'-en by the following formula: 



Where S.D. of x = standard error of the mean; in this case, the standard 

deviation of the sample distribution of means of the 
daily number of patrons who utilize the General Library. 

2 

(T = the population variance; in this case, the variance of 



days the General Library is open during the Fall 
Semester of 1973- 
N = size of sample; in this case, the total number of days 
chosen for sampling the number of patrons utilizing the 
General Library. 



Since the standard deviations cannot be known beforehand, it is necessary 
to obtain this estimate by using a known popixlation wfcoh similar parameters. 
An important assxm^tion here is that the standard deviation of the daily 
mean number of patrons utilizing the General Library dtiring the Fall Semester 
of 1975 is approximately equal to the standard deviation of the number of * 
patrons utilizing the General Library on a daily basis during the Pall Semes- 
ter of 1972. This assumption must be made for it is necessary to have an 




the daily number of patrons utilizing the library for 



one semester. 



size of the population; in this case, total number of 



estimate of the population standard deviation before error estimates can be 
cc3anputed, Thufi, the standard deviation of the daily number of patrons 
utilizing the General Library dxiring the Fall Semester 1972 was used to 
develop an estimate of the population standard deviation. It was found that 
the mean of the Fall Semestei* 1972 daily data was IkkS and the standard 
deviation was 72H, It i? possible that the size of tjie standard deviation 
for the Fall 1973 data could differ appreciably from the size of the stan- 
dard deviation for the Fall I972 data. However, as a check, the Fall I97I 
standard deviation of patrons daily utilizing the General Library was 
computed, and it was found to be comparable with the Fall 1972 figure. Thus, 
this evidence siqjports the assumption of yearly comparability of standard 
deviations. 

. ' A review of the study of Purdue Libraries Statistics (lMRU-01-72) 
was made with particular attention focused on the section conce:rning the 
General Library, patrons leaving the library, pages A-10 throoigh A-13. 
Tills was followed by a re- examination of functions performed by exit checkers 
both at the time of the original statistical stucJy and, recently, under 
current conditions. Time-and-motion study techniques were employed in the 
re- examination of functions currently performed by these personnel. 

The following methods were employed to determine a rougji estimate of 
the daily costs incurred under both past and current conditions for count- 
ing the number of patrons exiting the library. Data reported in the 1972 
statistical study were utilized to determine costs for past conditions.' 

Total anjiual seconds Total number of 

spent in counting 7 patrons utilizing = Average time per 
patrons exiting the the library per transaction 

library year 
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Total ntariber of 
patrons utilizing the 
library per year 



Total annual hours 
spent in counting 
patrons exiting the 
library 

Hours per day 



Mean number of 
transactions 
per hour 



Total days 
per year 



X Hourly rate 



For current conditions, the methods were as follows 

Total number of patrons 7 Mean number of = 
utili,?;ing the library transactions 
per year per hour 



Total annual hours 
spent in counting 
patrons exiting 
the library 

Hour^s per day 



Total days of 
the year 



X Hourly rate 



Total annual 
hours spent in 
counting patrons 
exiting the 

' library 

Hours per day 



Daily cost 



Total annual 
hours spent in 
counting patzKms 
exiting the 
library 

Hours per day 



Daily cost 



Results 



Ta'Kle I presents the results in summary form. Also, this table illus- 
trates the size of the error which Mght occur for various sample sizes 
selected to estimate the daily mean number of patrons utilizing the General 
Library during the Fall Semester of 1975 • In making the computations in 
Table I, it was assumed that 110 total days constituted a semester. This 
asstanption was based on the fact that the General Library operated I09 
days during the J'all Semester 1972. Minor deviations f!ram this figure would 
not significantly alter the values in Table 1. 

The first column (left to right) of Table I indicates the sample size 
or the number of days that counts might be made of the number of patrons 
utilizing the General Library. The second column is the error at the 68^ 



confidence level, and the third colximn is the error at the 95^ confidence 
level. These figures estimate within plus or minus bcainds the daily mean 
error that would be eocpected in the number of patrons utilizing the Genisral 
Library for the Fall Semester of 1975 by using the sample size selected in 
the first column. For example, if a sample of UO days were to be selected 
and the 95^ confidence interval was used, one could determine the expected 
error in the nximber of patrons utilizing the^General Library for the Pall 
Semester by the following means: Multiply the confidence interval foxrnd 
in Table I by t?ie number of days in the^^ semester. In this exaiiiple, the 
figures would be i l83 times 110 which equals i 20,150, This number indi- 
cates that 95 times out of 100 the estimate of the total number of patrons 
exiting the General Library during the Fall Semester would be within 
t 20^150 of the "true" total number of patrons exiting the General Library 
dviring that semester. Finally, to the extent that the Fall 1975 standard * 
deviation is greater than the Fall I972 standard deviation, the error will 
be greater than the Table I entry. Conversely, to the extent that the Fall 
1975 standard deviation is less than the Fall 1972 standard deviation, the 
en:or will be less than the Table I entry. 

The fourth coltamn contains estimated cost figures for each sairrple 
size. These cost figures are based on the data reported in the July 1972 
Study of Purdue Libraries Statistics (lMRU-01-72). Since the General 
Library now operates only two exits, it would be noted that the data for 
past procedui'^es, e.g^ column four, have not been interpolated to reflect 
costs for two exits rather than for four. In the original cost study, it 
was reported that in the General Library, 58 staff members spent a total of 
5212. 25 hours collecting statistics on the number of patrons utilizing the 



library at an annual cost of $7,031.95 or roughly $1.5? for each hour the 
library was open. As illustrated below, these figures were broken down 
further to reflect average daily costs incurred for counting patrons exit- 
ing through foxxr exits under past procedures. 



Total annual seconds 
spent in counting 
pa.trons exiting the 
library 

11, 5614-, 100 

Total number of 
patrons utilissing 
the library per year 

1,250,000 

Total annual hours 
spent in counting 
patrons exiting the 
library 

35215 

Hours per day 
8.8 



Total number of 
patrons utilizing 
the library per 
year 

1, 250,000 

Mean number 
of transactions 
per hour 

389 

Total days 
per year 



: 565 

■ Hourly rate 
X $2,00 



Average time per 
transaction 



= 9-25 seconds 

Total annual hours 
spent in counting 
• patrons exiting 
the library 

= 3213 

Hours per day 



= 8,8 

Daily cost 
= $17,60 



However^ since the original cost study, there has been a significant change 
in number of exits and functions performed by personnel responsible for 
counting patrons leaving the library. Currently there are two exits, whereas, 
in the past there were four. Also, in the past, these personnel were primarily 
responsible for, in order of priority, (a) checking out books and other^ 
circulation functions, (b) checking briefcases, i.e., making sure all books 
that left the library were checked out, and (c) counting the number of 
patrons exiting the library. Currently these personnel are primarily respon- 
sible for (a) checking briefcases, i.e., determining that all books exiting 
the library have been checked out properly, and (b) counting the number of 
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patrons who exit the library. Also, the way they approach the task of count 

\ 

• - \ 
ing has changed substantially. The counter is now held in the checker ♦s 

hand on a nearly continuous basis as they check briefcases, etc. In the 

past, they would lay the counter down to check out books and hsmdle other 

circulation. duties and they they would have to retrieve the counter and 

count, or estiuiate a count, of the number of patrons who had just exited. 

It is quite likely that under past circumstances, the coxmting procedure was 

significantly more time consuming than it is under current procedures. Thus 

as shown in column five, the recent time- and-mot ion study revealed that 

personnel counting patrons exiting the library were spending approximateily 

one second per transaction to complete this task while previously they 

reported spending roughly 9-25 seconds per transaction. Thus, average daily 

costs under current procedures used to count patrons exiting the library 

break down as follows: 



Total number of patrons 
utilizing the library 
per year 



1,250,000 

Total annual hours 
spenij in counting 
patrons exiting 
the library 

Hours per day 
^95 



Mean number of 
transactions 
per hour . 



5600 

■ / 

Total days of 
the year ^ 



365 

Hourly rate 
$2.00 



Total annual 
hoiars spent in 
counting patrons 
exiting. the 
library 

Hours per day 



'.95 

Daily cost 
$1.90 
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Conclusions and RecoHgnendations 

Several conclusions may be drawn frcm this study. Perhaps, the use of 
an alternate sainpling method might reduce the size of the error estimates 
reported in Table I. One method would be a stratified random sampling 
technique which might specify that each day of the week be included an equal 
number of times in the sample. For example, if the sample size were 42 
days, then each day of the week would be selected 6 times for inclusion in 
the sample. These h2 days would be selected from the total popixlation of 
days in the sraiester. However, the use of this technique does not allow the 
utilization of a readily derived mathematical function to estimate the error. 
When no concise error estimates are required beforehand, the use of the — 
stratified random sampling technique is a possibility. 

Library use is a c^joplex process? involving a myriad of unidentified 
variables. Many events including patron utilization of a large university 
library do not occur on a purely random basis. There are periods of intense 
utilization followed by periods of little use of library facilities. Semester 
mid- terms, finals, weekends and holidays are examples of significant factors 
- which must be accounted for in predicting total yearly use based on sampling 
techniques. Therefore, in the example presented in this paper, the strati- 
fied random sampling technique would likely yield lower error estimates 
thajQ those foiind in Table I where "pure" random sampling was employed. 

After studying the data, the following recommendations can be made: 

lo time-and-motion study of patron count procedures in departmental 
libraries needs to be made based on methods used to develop cost data for 
column 5 of Table I. This would provide a further look at the costs of 
data collection in the departmental libraries. 



11 

2. If costs still appear to be unusually large, then explore the 
advisability of employing stratified sajnpling techniques in the departmental 
libraries, 

3. Unless the job function for the checkers varies significantly from 
current conditions, it is recommended that patrons be counted on a continuous 
basis as they exit the General Library. 
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